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(57) Abstract 



A portable communication device (10) is controlled by voice recognition circuitry (20) remote from the portable communi- 
cation device. The portable communication device includes apparatus for producing and transmitting a parametric representation 
of voice commands. The remote circuitry (which could possibly be a base station, a mobile repeater, or simply a dedicated box, 
separate from the portable) produces control signals, responsive to the parametric representation of voice commands, for con- 
trolling the portable communication device. 



FOR THE PURPOSES OF INFORMATION ONLY 

Code* used to identify Stale*, party to the PCT on the fioni pages of pamphlet* publishing international 
applications undei the PCT. 



AT 


Austria 


Fl 


t'inlanJ 


All 


Australia 


FR 


h ranee 


BB 


Barbado* 


CA 


Ci abon 


BE 


Belgium 


CB 


United Kingdom 


BP 


Burkina Fa*o 


CN 


Guinea 


BC 


Bulgaria 


CR 


Greece 


BJ 


Bent it 


HU 


Hungjry 


BR 


Brazil 


IE 


Ireland 


CA 


( anatfa 


IT 


llal> 


CF 


Central African Republic 


JP 


Japan 


CC 


Congo 


KP 


Democratic People's Republic 


CH 


SwtL/ci land 




of Korea 


CI 


Cole d*l voire 


KR 


Republic of Korea 


CM 


Cameroon 


LI 


I icchlenMeJn 


CS 


Czechoslovakia 


LK 


Sri l^iitka 


DE 


(iermuny 


LU 


Luxembourg 


UK. 


Denmark 


MC 


Monaco 


ES 


Spain 


MC 


Madagascar 



Ml 


Mali 


MN 


Mongolia 


MR 


Mauritania 


MW 


Malawi 


NI- 


Netherlands 


NO 


Norway 


PL 


Puland 


RO 


Romania 


RU 


Russian Federation 


SD 


Sudan 


SE 


Sweden 


SN 


Senegal 


su 


Soviet Unton 


TD 


Oud 


TC 


Togo 


US 


United Stales of America 



WO 93/01664 



PCI7US92/05577 



1 



5 



10 

REMOTE VOICE CONTROL SYSTEM 

Technical Field 
1 5 This invention relates generally to radio communication 

systems, and more specifically, to radio communications systems 
using digital signal processing (DSP) and voice control. 

Background 

20 As portable radios decrease in size, and increase in the 

number of available options, it becomes more difficult to supply 
sufficient keyboard and display functions due to size, cost, 
complexity, and current drain. An alternate method that could use 
a smaller keypad, but still control and increasing number of 

25 options would thus be desirable. A voice recognition system 

could accomplish this control with a minimal keypad, but it would 
require complicated digital signal processing circuitry that is 
expensive, and consumes significant amounts of current, both 
highly undesirable in a portable radio. Accordingly, a need exists 

30 for a voice-controlled portable radio that avoids the detriments of 
the prior art. 

Summary of the Invention 

Briefly, in accordance with the invention, a portable 
35 communication device is controlled by voice recognition circuitry 
remote from the portable communication device. The portable 
communication device includes means for transmitting a signal 
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representing voice commands. The remote circuitry produces 
control signals, responsive to the signal representing voice 
commands, for controlling the portable communication device. 
Thus, a voice-operated portable communication device is 
5 provided that overcomes the detriments of the prior art. 
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Brief Description of the Drawings 

FIG. 1 is a block diagram of a vehicular repeater system in 
accordance with the invention. 
1 o FIG. 2 is a block diagram of another communication system 

in accordance with the invention. 

FIG. 3 is a block diagram of a further communication 
system in accordance with the invention. 

FIG. 4 is a block diagram of a portable radio 
1 5 communication device in accordance with the invention. 

FIG. 5 is a functional block diagram of a communication 

system in accordance with the invention. 

FIG. 6 is block diagram of a base station repeater in 
accordance with the invention. 

20 

Detailed Description of the P referred Embodiments 

Referring to FIG. 1 , there is shown a vehicular repeater 
system in accordance with a preferred embodiment of the 
invention. The system comprises a portable radio 10, a voice- 

25 controlled mobile repeater unit 12 and a base 14. In this 
embodiment, the voice recognition circuitry is in a voice 
recognizer 20 located in the mobile repeater unit 12. The mobile 
repeater unit also comprises a mobile radio transceiver 18, an 
auxiliary transceiver 16, and a microprocessor controller 22. 

30 Signals are transmitted by the portable radio 10, received by the 
vehicular repeater unit 12, and repeated to the base station 14. 

Access to the voice recognizer 20 is accomplished over the 
normal transmission of the portable 10. In accordance with the 
invention, a user of the portable 10 utters voice commands and 

35 the portable transmits a representation of the voice commands, 
i.e., voice-modulated or digitally coded voice signals. If the 
portable 10 uses a digital vocoder, the parametric representation 
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is the digitally-coded speech. Speech coding parameters are 
well-known in the art, examples of which are found in LR. 
Rabiner and R.W. Schafer, Digital Processing of Speech Signals, 
Prentice-Hall. 1978, at Chapters 6-8. Digitally coded speech often 
5 uses linear predictive coding (LPC) parameters, which are also 
used in speech recognition. See LR. Rabiner and S.E. Levinson, 
"Isolated and Connected Word Recognition - Theory and 
Selected Applications," IEEE Transactions On Communications, 
Vol. Com.-29, May 1981, at pp. 621-659. Therefore, a recognizer 

1 0 receiving digitally coded speech does not extract features, but 
can obtain them directly from the encoded speech. 

A radio-frequency (RF) link is preferred for sending the 
commands and data back and forth from the mobile unit 12 to the 
portable 10. Thus, the parametric representation of commands 

1 5 that are given in a voice message from the user of the portable 
unit 10 will be received by the mobile repeater unit 12 and 
"recognized" as command words. The mobile repeater unit 12 
then interprets the command and sends an acknowledgement of 
the command and a data/control message back to the portable 

20 10, in response to the voice command. 

In the vehicular repeater system, the voice recognizer 20 
would accept speech input from two sources: a microphone in 
the vehicle and from the- portable (through the repeater's auxiliary 
transceiver 16). Thus, the portable radio user can control the 

25 mobile radio transceiver 18 while in the vehicle (i.e., using the 
microphone), or through the portable 10, while outside of the 
vehicle. The state of the mobile repeater unit 12 could be sent to 
the portable 10 with synthesized voice and subaudible signalling, 
or a data packet could be sent to update or control the portable 

30 10. When the portable 10 talks to the mobile repeater unit 12, 
different subaudible signals could differentiate the voice 
commands from normal conversation. 

Referring to FIG. 2, the portable radio 104 operates as a 
talk/listen radio to the base 108, and includes added capabilities 

35 for exchanging data, voice and control signals with the voice 
recognizer 102. A remote portable 104 sends voice to the base 
108 and voice commands to the recognizer 102. A connection 



WO 93/01664 



PCT/US92/05577 



using any of several communication media such as RF, fiber 
optic, or hard-wired connection may be made between the base 
108 and remote recognizer 102. Thus, if desired, information may 
be transferred between the base 108 and the recognizer 1 02. 
5 Different subaudible signals could differentiate the voice 

commands from normal conversation. The voice recognizer 102 
determines the voice commands given through the portable 104, 
and sends back the necessary control signals to effect the control 
in the portable 104. Through this pairing, voice control of the 

1 0 portable 1 04 (or any unit connected to the recognizer or portable) 
is possible without the need of voice control circuitry in the 
portable 104 itself, requiring almost no additional complexity in 
the portable 104, A simple microprocessor in the portable 104 
would be able to decode data packets, and execute the required 

1 5 control functions. Several portables could be serviced by a single 
remote recognizer. 

Referring to FIG. 3, in a third possible embodiment, a voice 
recognizer 206 is coupled to a base station 202 via a 
microprocessor 204. In this embodiment, both the base 202 and 

20 the portable 200 can be remotely controlled by voice commands. 
The same control/data packets will be required from the base 202 
as were required from the remote voice recognizer 102 in the 
previous embodiment. However, if the status of the base 202 is 
altered, some additional acknowledgements/data may be needed 

25 to inform the user of the change. 

A further benefit obtained by placing the voice recognizer 
102 in or near the base 202 is that the voice recognizer 206 can 
now be shared by multiple remote units over the entire coverage 
range of the base station 202. If the incoming voice commands 

30 are coded with the identity of the sender, then control sequences 
can be sent back to that individual sender. Thus, the recognition 
circuitry can be used at maximal efficiency. Fleet management 
functions could also provide control for a pre-defined group or 
fleet of portables. 

35 Referring to FIG. 4, there is shown a block diagram of a 

portable radio communication device 300, in accordance with the 
invention. When the portable radio communication device 300 is 



in the receive mode, the transmit/receive switch 302 connects the 
antenna to a receiver section 304 which converts the received RF 
signals to either intermediate frequency (IF) or audio frequency 
signals. The output of the receiver section 304 is provided to an 
anti-aliasing filter 306 for filtering out of frequencies outside the 
desired sampling range. An analog-to-digital converter 308 
converts the analog output of the anti-aliasing filter 306 to a digital 
signal suitable for processing by a digital signal processor 
(DSP)/controller 310. The DSP/controller 310 performs the 
desired processing of the received signals. The received signals 
may include both a data component and a non-data component 
(e;g., conversation). In the event that the data portion includes 
control signals, the appropriate control signals are are decoded 
and implemented in the portable 10. The non-command 
Information received is presented to the user of the portable by 
converting the processed signal to analog form with a digital-to- 
analog converter 31 4. The signal is filtered (31 6) to remove 
undesired components, and amplified (318) and converted to 
sound by the speaker. 

There are two sub-modes in the transmit mode: a first sub- 
mode is triggered by activating a push-to-talk (PTT) switch 346, 
and the second mode is triggered by activating a push-to- 
recognize (PTR) switch 348. The two user-activated switches 346 
and 348 are coupled to the DSP/controller 31 0 to place the 
portable 300 in the desired mode. The second mode is a voice 
recognition mode that can be added to a portable to enhance 
voice recognition operation. The second mode can also 
(alternatively) be activated remotely using control of the voice 
recognition unit by providing for continuous recognition of the 
signals transmitted by the portable, to trigger the voice command 
recognition mode. 

In the voice command recognition mode (or PTR) mode, 
the portable is in a mode whereby the DSP and controller 310 
"knows" that it is receiving voice commands, and accordingly may 
use a faster sampling rate so that additional processing may be 
done on the voice command signals. Since the controller knows 
that a command word is being uttered, it may implement a 
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different sampling rate to obtain voice samples and perhaps a 
different coding scheme to transmit the command word to the 
base. This coding scheme need not require transmission of the 
command words in "real time." By processing the command 
5 words differently than regular speech, the fidelity of the utterance 
can be preserved, thus producing a better recognition decision. 
The use of the push-to-recognize switch 348 also eliminates the 
need for the speech recognition apparatus to identify command 
signals. 

10 In the PTT mode, the portable receives voice from the 

microphone 344 for voice conversation as in a normal push to talk 
radio. The voice is amplified by a pre-amplifier 326 and applied 
to an anti-aliasing filter 324. The filtered output of the anti- 
aliasing filter 324 is applied to an A/D converter 322, and the 

1 5 resulting digital signal is applied to the DSP/Controller 310 for 
processing. The DSP/Controller 310 applies the processed 
signals to a D/A converter 342 for conversion to analog form. The 
output of the D/A converter 342 is applied to a reconstruction filter 
340 for removing artifacts (e.g., transients) resulting from the D/A 

20 conversion. The resulting signal is applied to a modulation 
section 338. The modulated signal is amplified by a power 
amplifier 336, and filtered by filter 334 before transmission. 

The DSP/controller 310 may be a known DSP integrated 
circuit alone or combined with a conventional microprocessor, 

25 controlled by software to extract speech recognition features form 
the sampled speech. In the PTR mode, the DSP and controller 
31 0 produces a parametric representation of the voice commands 
(e.g., by extracting speech recognition features from the speech 
received from the user of the portable 10) from the microphone 

30 344. The parametric representation of the voice commands is 
then applied to a modulator 338 through a data filter 328. The 
modulator 338 could use any form of modulation (e.g., A.M., F.M., 
or quadrature AM modulation, i.e., QAM). The resulting 
modulated signal is applied to the power amplifier 336 for 

35 amplification to a level suitable for over-the-air transmission. The 
resulting amplified signal is filtered at a transmit filter for removal 
of undesired harmonics from the RF signal to be transmitted. 
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The PTR switch could enable the addition of tone coded 
squelch or digital coded squelch signals to the modulation 
section 338 via the data filter 328 to both identify and differentiate 
the two transmit modes. In this way, the receiving unit would 
5 know when to repeat the voice message or when to look for a 
voice command. The PTT and the PTR would use different tones 
or codes to differentiate between them. 

The addition of the second transmit mode is an 
improvement to the basic idea. This would add three important 
10 improvements to the communication systems shown in the 
previous figures. 

First, it would limit the recognizer to verbal inputs that were 
strictly commands. This way, normal verbal messages used in 
communication to other people would not cause an improper 
1 5 response from the recognizer if it happened to sound similar to a 
voice command. In fact, the portable could be simplified from 
those of today while providing new enhanced features. 

Second, the voice commands that were give to control the 
unit would not be broadcast to others listening to the radio 
20 system. 

Third, the voice commands could be processed differently, 
in the portable and at the base, from normal conversation to 
improve the remote recognition accuracy. 

In the case of the Mobile Repeater, the recognizer built into 

25 the mobile radio would normally be used to control the mobile 
when the user was in the car. Since the car and the mobile 
microphone have certain acoustical properties, the recognizer will 
be designed for this case. Now when the portable is used outside 
the car to operate the mobile radio, the activation of the PTR 

30 switch during the issuance of verbal commands will allow the 
mobile recognizer to adjust various parameters of the voice 
recognition system to enhance the recognition of the detected RF 
signal containing the voice of the remote user. This allows the 
frequency response of the portable to be taken into account in the 

35 recognition process. 

Adding a Voice Synthesizer to the remote recognizer unit 
will allow the acknowledgements to voice commands to be made 



WO 93/01664 



PCT/US92/05577 



verbally. In this way the user could request a status of what 
channel is being used or receive a verbal acknowledgement to 
channel changes etc. In this approach, a display may not be 
needed. 

5 The portable could be used for the repeater case as a 

wireless control head while inside the car. In this case it would be 
desirable to avoid transmitting inside the vehicle and perhaps 
overdriving the nearby receiver. Further, if considered that there 
would only be a limited number of frequencies for the portable to 

1 0 mobile link, it would seem to be undesirable to have people 

inside cars using up these links. By using an Infra-Red channel, 
the battery could be saved from the normal current drain of the 
portable transmitter if the infra-red transmit power is lower than 
the RF transmit power If the IR link between the portable and the 

1 5 mobile interface were to fail, the RF transmission could be 

resumed. If desired, an ultra-sonic link could be used as an added 
method of communication between the portable and the mobile 
unit inside the car. 

Referring to FIG. 5, there is shown a functional block 

20 diagram of a simplified communication system 400 in accordance 
with of the invention. A portable radio 402 is similar to portable 
radio 300 but simplified in that there are no DSP or A/D 
converters. A base station repeater 430 includes the remote 
voice recognition circuitry in accordance with the invention. 

25 Basically, the portable radio 402 receives voice commands from 
the user and transmits RF signals modulated with the voice 
signals. The base station repeater 430 receives the voice 
modulated signals and transmits control signals back to the 
portable 402. 

30 The portable 402 can affect control of the base unit 430 to 

recognize voice commands preferably by sending the proper low 
speed coded squelch tone or digital word, or other subaudible 
signal, transmitted continuously with voice the commands. 
Alternatively, a high speed data packet could be sent, causing the 

35 base 430 to enter the voice-recognition mode for a predetermined 
time interval or until the proper low speed coded squelch tone, 
digital squelch word, or another data packet is received by the 
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base 430. Moreover, the control signals can be low speed data, 
or high speed data packets or a combination. 

When the portable 402 is in the receive mode, it receives 
from the base, voice-modulated signals and control signals. 
5 When the PTR switch is activated voice commands received at 
the microphone 420 are amplified (419) and applied to a 
modulator 416. The microprocessor 412 may add low speed data 
modulation to the signal being modulated. 

Referring to FIG. 6, there is shown a block diagram of a 

1 0 base station repeater 430 in accordance with of the invention. 
The base station includes a receiver 432, which receives signals 
to be repeated by a transmitter 434. The receiver 432 provides 
an audio signal representative of the the received signal to an 
audio filter 436 which conditions the audio signal for processing 

15 by a voice recognizer 444. A delay/switch block 442 is under the 
control of the microprocessor controller 440, and when enabled, 
causes the audio information to pass from the audio filter 436 to 
the transmit modulator 446. Thus, the audio is repeated by the 
transmitter. If two switches, i.e., a PTT and PTR switch, are used 

20 to differentiate the two different modes, a delay is not needed, and 
block 442 only serves to mute and unmute the transmit audio. If 
there is no PTR switch (for a given implementation) on the 
portable, and only key words are used to differentiate the modes, 
then a delay would be desirable in addition to the switch function 

25 in block 442. 

The delay/switch block 442 could be used to delay the 
radio signal while it is analyzed for "key words" by the voice 
recognizer 444. The delay/switch block 442 breaks the path of 
the audio, thereby muting the key word and following command 

30 words from being repeated. This muting process remains in effect 
for a predetermined time, or until the input signal is lost, another 
key word ends the voice commands, or by data signalling from 
the portable unit, or by control of the microprocessor controller 
440. 

35 The switch/delay block 442 could work in several ways. 

First, a constant delay could be used all the time. However, such 
delay would be undesirable to users. Second, if key words are 
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only given in the beginning of a transmission, the delay can start 
at a given value and then be gradually reduced after the first few 
seconds. This reduction would reduce the delay to zero or near 
zero. Third, there can be no delay at all. In this method, the key 
5 word would be heard by other users of the communication 
system, but the following words would be muted. 

The data filter 438 prepares the received data to be 
sampled by the microprocessor controller 440. This data could 
be sub audible tone encoded or digitally encoded squelch words, 

1 0 different codes or tones being used for the purposes of indicating 
the mode of the portable (i.e., PTT mode or PTR mode) or for high 
speed data packets used for receiving data and control packets. 

The delay is not required in delay/switch block 442 when a 
switch is used to manually enter the PTR mode. 

1 5 The microprocessor controller 440 can also send data to 

the portable by passing signals through the data filter 448 prior to 
being applied to the modulator 446. Both low speed and high 
speed data could be sent via this path. 

The microprocessor controller 440 can also respond to 

20 commands of the portable or to give update messages by 
activating a voice synthesizer 450 which generates an audio 
"voice" signal which is applied to the modulator 446. 
A data filter 438 filters the audio output of the receiver 432 for 
processing by a microprocessor controller 440. 

25 Thus, there is provided a voice-controlled portable radio 

that avoids the detriments of the prior art by using speech 
recognition apparatus at a remote location to provide the 
controlling signals to the portable radio, over a communication 
medium. 



30 
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Claims 

1. In a communication system including a voice 
recognition apparatus, a portable radio remote from the voice 
recognition apparatus, the portable radio comprising: 
5 transceiver means for receiving and transmitting signals; 

controller means, coupled to the transceiver means; 
means for receiving voice commands; 
the transceiver means comprising: 

means for transmitting a representation of the voice 
1 0 commands to the voice recognition apparatus; and 

means for receiving control signals from the voice 
recognition apparatus, the means for receiving control signals 
being responsive to the representation of the voice commands, 
and being coupled to the controller means for providing the 
15 control signals thereto. 
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2. The portable radio of claim 1 , wherein the 
representation of the voice commands is parametria 

5 3. The portable radio of claim 1 , wherein the 

representation of the voice commands comprises recognition 
features of the voice commands. 

4. The portable radio of claim 1 , further comprising: 
1 o first means for placing the portable radio in a conventional 

transmit mode when the first means is activated; and 

second means for placing the portable radio in a voice 
command transmit mode when the second means is activated. 

15 5. The portable radio of claim 4, wherein the second 

means comprises a switch. 

6. The portable radio of claim 4, wherein the second 
means is responsive to voice commands. 
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7. A communication system comprising: 

voice recognition means for receiving parametric 

representations of voice commands, and for providing control 
5 signals responsive to the parametric representations of voice 

commands; and 

a portable radio comprising: 

transceiver means for receiving and transmitting signals; 
controller means, coupled to the transceiver means, for 
1 0 controlling the operation of the transceiver means; 
means for receiving voice commands; 
means for producing a parametric representation of the 
voice commands; 
the transceiver means comprising: 
1 5 means for transmitting the parametric representation of the 

voice commands to the voice recognition apparatus; and 

means for receiving control signals, responsive to the 
parametric representation of the voice commands, from the voice 
recognition apparatus, the means for receiving control signals 
20 being coupled to the controller means for providing the control 
signals thereto. 
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8. The portable radio of claim 7, further comprising a 
vehicular repeater comprising the voice recognition means. 

9. The portable radio of claim 8, further comprising a base 
5 coupled to the voice recognition means. 

10. The portable radio of claim 9, further comprising at 
least a second portable communication unit coupled to the voice 
recognition means. 
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FIG. 2 
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